Importance Sampling for a Monte Carlo Matrix Multiplication Algorithm, with Application to Information Retrieval

نویسندگان

  • Sylvester David Eriksson-Bique
  • Mary Solbrig
  • Michael Stefanelli
  • Sarah Warkentin
  • Ralph Abbey
  • Ilse C. F. Ipsen
چکیده

We perform importance sampling for a randomized matrix multiplication algorithm by Drineas, Kannan, and Mahoney and derive probabilities that minimize the expected value (with regard to the distributions of the matrix elements) of the variance. We compare these optimized probabilities with uniform probabilities and derive conditions under which the actual variance of the optimized probabilities is lower. Numerical experiments with query matching in information retrieval applications illustrate that the optimized probabilities produce more accurate matchings than the uniform probabilities and that they can also be computed efficiently.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monte Carlo Simulation of Multiplication Factor in PIN In0.52Al0.48As Avalanche Photodiodes

In this paper, we calculate electron and hole impactionization coefficients in In0.52Al0.48As using a Monte Carlo modelwhich has two valleys and two bands for electrons and holesrespectively. Also, we calculate multiplication factor for electronand hole initiated multiplication regimes and breakdown voltagein In0.52Al0.48As PIN avalanche photodiodes. To validate themodel, we compare our simulat...

متن کامل

A Stochastic algorithm to solve multiple dimensional Fredholm integral equations of the second kind

In the present work‎, ‎a new stochastic algorithm is proposed to solve multiple dimensional Fredholm integral equations of the second kind‎. ‎The solution of the‎ integral equation is described by the Neumann series expansion‎. ‎Each term of this expansion can be considered as an expectation which is approximated by a continuous Markov chain Monte Carlo method‎. ‎An algorithm is proposed to sim...

متن کامل

Randomized Approximation of the Gram Matrix: Exact Computation and Probabilistic Bounds

Abstract. Given a real matrix A with n columns, the problem is to approximate the Gram product AA by c ≪ n weighted outer products of columns of A. Necessary and sufficient conditions for the exact computation of AA (in exact arithmetic) from c ≥ rank(A) columns depend on the right singular vector matrix of A. For a Monte-Carlo matrix multiplication algorithm by Drineas et al. that samples oute...

متن کامل

Stopping-time resampling for sequential Monte Carlo methods

Motivated by the statistical inference problem in population genetics, we present a new sequential importance sampling with resampling strategy. The idea of resampling is key to the recent surge of popularity of sequential Monte Carlo methods in the statistics and engineering communities, but existing resampling techniques do not work well for coalescent-based inference problems in population g...

متن کامل

A Simplified Proof For The Application Of Freivalds' Technique to Verify Matrix Multiplication

Fingerprinting is a well known technique, which is often used in designing Monte Carlo algorithms for verifying identities involving matrices, integers and polynomials. The book by Motwani and Raghavan [1] shows how this technique can be applied to check the correctness of matrix multiplication – check if AB = C where A,B and C are three n×n matrices. The result is a Monte Carlo algorithm runni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • SIAM J. Scientific Computing

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2011